Comparison and Adaptation of Automatic Evaluation Metrics for Quality Assessment of Re-Speaking
نویسندگان
چکیده
Re-speaking is a mechanism for obtaining high quality subtitles for use in live broadcast and other public events. Because it relies on humans performing the actual re-speaking, the task of estimating the quality of the results is non-trivial. Most organisations rely on humans to perform the actual quality assessment, but purely automatic methods have been developed for other similar problems, like Machine Translation. This paper will try to compare several of these methods: BLEU, EBLEU, NIST, METEOR, METEOR-PL, TER and RIBES. These will then be matched to the human-derived NER metric, commonly used in re-speaking.
منابع مشابه
The Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language
Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...
متن کاملGoogle Scholar journal metrics: Comparison with impact factor and SCImago journal rank indicator for nuclear medicine journals
Introduction: In the current study, we compared h5-index provided by Google Scholar (GS), impact factor (IF) provided by web of sciences (WOS), and SCImago journal rank indicator (SJR) provided by SCOPUS for quality assessment of nuclear medicine journals. Methods: 2013 h5-index, 2012 IF, and 2011 SJR of nuclear medicine journals were extracted from their publishers namely GS, WOS, and SCOPUS....
متن کاملTags Re-ranking Using Multi-level Features in Automatic Image Annotation
Automatic image annotation is a process in which computer systems automatically assign the textual tags related with visual content to a query image. In most cases, inappropriate tags generated by the users as well as the images without any tags among the challenges available in this field have a negative effect on the query's result. In this paper, a new method is presented for automatic image...
متن کاملPatient Assessment of Constipation Quality of Life Questionnaire: Translation, Cultural Adaptation, Reliability, and Validity of the Persian Version
Background: The Patient Assessment of Constipation Quality of Life (PAC-QOL) questionnaire is the most validated and the most specific tool for measuring the quality of life of patients with constipation. Over 120 million people live in countries whose official language is Persian. There is no reported Persian version of the PAC-QOL questionnaire yet. The aim of this study was to translate and ...
متن کاملA Robust SAR NLFM Waveform Selection Based on the Total Quality Assessment Techniques
Design, simulation and optimal selection of cosine-linear frequency modulation waveform (CNLFM) based on correlated ambiguity function (AF) method for the purpose of Synthetic Aperture Radar (SAR) is done in this article. The selected optimum CNLFM waveform in contribution with other waveforms are applied directly into a SAR image formation algorithm (IFA) and their quality effects performance ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer Science (AGH)
دوره 18 شماره
صفحات -
تاریخ انتشار 2017